Alibaba Unveils Advanced Qwen3-Next AI Models on NVIDIA Platform

BTCC / BTCC Square / Global Cryptocurrency /

Author:

Published:

2025-09-16 13:04:02

BTCCSquare news:

Alibaba has introduced two new open-source AI models, Qwen3-Next 80B-A3B-Thinking and Qwen3-Next 80B-A3B-Instruct, featuring a hybrid Mixture of Experts (MoE) architecture. These models are designed to enhance efficiency and performance, particularly when deployed on NVIDIA's advanced platform. The architecture activates only 3 billion of the total 80 billion parameters per token, combining the power of large-scale models with the efficiency of smaller ones.

Optimized for long context lengths, the models can process over 260,000 tokens in input. They leverage NVIDIA's Blackwell 5th-generation NVLink, which provides 1.8 TB/s of direct GPU-to-GPU bandwidth, significantly reducing latency and improving token throughput during complex tasks. The models incorporate 48 layers, with every fourth LAYER featuring innovative architectural enhancements.

By:

China Stock Market Rally Attracts Global Investors Despite Economic Challenges

|Square

Get the BTCC app to start your crypto journey

Download on the App Store GEI IT ON Google Play

Get started today Scan to join our 100M+ users

Recommended

Promotions

Alibaba Unveils Advanced Qwen3-Next AI Models on NVIDIA Platform

|Square